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Large area yield forecasting for the United States and foreign 
areas is an application of plant growth simulation models with great 
potential. Forecasts from these models could be a powerful tool for 
agricultural and economic policy makers in both government and industry. 

Daily solar radiation data are required in most models. Until 
recently, solar radiation had been measured at only a few locations 
around the world. Because of the lack of available data, various surro- 
gates for measured solar radiation are being developed and tested for 
use in the models. Recently, satellite estimated solar radiation became 
available for most of the Western Hemisphere on a ''real-time'* basis. If 
satellite data can be used to produce accurate solar radiation estimates 
one obstacle to using simulation models for large areas or many loca- 
tions would be removed. 

Calibration of a particular crop growth simulation model would be 
needed for area-specific factors: soil fertility, water holding capa- 

city, varietal characteristics, fertilizer and pesticide applications, 
planting practices, and other management practices. This calibration 
also requires historical yield and meteorological data including solar 
radiation data for that area. Because historical solar radiation data 
tare not generally available, a surrogate must be used. In this paper 
five algorithms are compared for producing solar radiation surrogates 
from commonly measured daily meteorological variables. 

As used in this paper, a solar radiation "surrogate" will produce 
solar radiation data which is similar to observed data in terms of 
various statistical measures: similar daily mean, similar variability, 

etc. These surrogates are not intended to accurately predict observed 
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1 data on any cjiven day. The term “estimate" is reserved for accurate 

2 daily predictions, and estimates of yearly yield. 

3 SOLAR RADIATION ALGORITHMS 

4 Five algorithms producing surrogate measures of solar radiation 

5 are compared in this study. They are referred to as CE, SR, RO, Rl, and 
5 R2. The CE algorithm (Cengiz et al, 1981) was developed using data from 

7 Columbia, Missouri. It is composed of two types of functions. Location 

8 specific functions require information on latitude. Daily functions 

9 require the day of the year and daily maximum and minimum temperature 

10 (Table la). 

11 The SR, RO, Rl, and R2 algorithms are based on the Richardson 
12(1981) weather simulation model (Table lb, c, d, e). The Richardson 

13 model uses a set of location specific constants to estimate daily rain- 

14 fall. solar radiation, and maximum and minimum temperature. These 

15 constants would be available only for locations in the continental 

16 United States. The solar radiation and temperature values are estimated 

17 as daily deviations from annual curves. The annual curves consist of 

18 long term average daily values. Separate curves are used for dry days 

19 and for rainy days. Rainy days are defined as those days for which 

20 rainfall has been estimated as being greater than zero. The algorithms 

21 used in this study modified Richardson's model so that observed tempera- 

22 tures and rainfall were used to estimate solar radiation. 

23 The SR algorithm (Table lb) was based only on Richardson's annual 

24 curves for normal radiation. Separate curves were used for dry days and 

25 for rainy days. There were no daily deviations from the annual curves. 

26 To estimate daily deviations from the annual curve values for solar 

27 radiation, temperature, and precipitation, the Richardson model uses 
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correlations, one day lag correlations and a random component. The 
correlations and one day lag correlations were reported to be approxi- 
mately uniform for the continental U.S. (Richardson, 1981). It may be 
acceptable to extend these correlations to other regions. In the RO 
algorithm, the actual deviations of maximum and minimum temperatures and 
the correlations were used to estimate the daily deviation of solar 
radiation (Table lc). 

The R1 and R2 algorithms also use Richardson's correlations among 
daily deviations of temperature and solar radiation. These correlations 
and the actual daily deviations of maximum and minimum temperatures are 
used to produce daily deviations of solar radiation in the R1 algorithms 
(Table Id). 

Because the daily variability of solar radiation estimated by R1 
was too small, the daily deviations of 5 were amplified for the R2 
algorithm (Table le). The amplification was moderate for deviations 
above the annual curves but greater for deviations below the annual 
curves. Measured data (DOE, 1979) from St. Cloud, MN; Rapid City, SD; 
and Glasgow, MT were used to determine the degree of this amplification. 
Richardson's annual mean values of solar radiation were changed by the 
amplification. The new "annual mean values" were approximately 5% 
greater than the actual values for dry days and 15% greater for rainy 
days. 

DATA 

The SOLDAY (DOE, 1979) data set encompasses the period 1952-1974. 
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It consists of measured and rehabilitated (adjusted for known procedural 
and instrumental errors) daily solar radiation values and associated 
maximum and minimum daily temperatures and rainfall for 27 U.S. 


I 



1 itations. The rehabilitated solar radiation data were viewed ^s 

2 ‘ground truth" for the purposes of this study. Seven stations (Table 2) 
*ere selected to compare the five algorithms producing solar radiation 

I surrogates for the rehabilitated solar radiation values. Three of the 
5 seven stations (St. Cloud, Rapid City, and Glasgow) had been used in the 
j level opment of the R2 algorithm. Seven surrogate data sets, one for 
reach station, were developed for comparison with ground truth (Table 2). 
j When the rehabilitated and simulated solar radiation values were 
compared over the seven stations, several things became apparent. Some 
i of the rehabilitated solar raliation values were obviously too large 
(greater than 85% of solar radiation at the top of the atmosphere). The 
; R2 and the CE algorithms also occasionally estimated excessively high 
lvalues, the CE algorithm more so than R2. 

The SOLDAY (DOE, 1979) data also included daily values of solar 
radiation at the top of the atmosphere and the percent possible 
transmissivity of the atmosphere for each location. 

For use with the yield model, the rehabilitated solar radiation 
values, the R2 algorithm values, and the CE algorithm values were 
screened for values greater than maximum potential. This maximum was 
defined to be the product of percent possible atmospheric transmissivity 
(%T) and solar radiation at the top of the atmosphere (ETSR). Values 
higher than this maximum were reduced to the maximum. This screening 
algorithm reduced the observed solar radiation values by an average of 
0.7%, the R2 algorithm values by an average of 0.5%, and the CE algorithn 
values by an average of 1.0%. Values from the R0, Rl, and SR algorithms 
were not affected by the screening. 






COMPARISON METHODOLOGY 

The surrogates of solar radiation could themselves be compared to 
i the rehabilitated ground truth data to determine which was the best 
estimate. These surrogates were developed solely for use in simulation 
models* however. Because of this, it was felt that their impact on 
yield prediction in the models would be the important criterion. The 
best algorithm would not necessarily be the one which produces the best 
g estimates of ground truth but rather would be the one which produces 
g Similar yield predictions when used in a simulation model. 

10 The Ceres-wheat model (Ritchie and Godwin, 1983; Otter, Ritchie and 
U Godwin, 1983) was the model selected for comparison of the five solar 

12 surrogates. This program requires initial parameter values for initial 

13 soil water content, soil water retention characteristics, variety of 

14 wheat ( Triticum aestivum L.) planting density and depth, planting date, 

15 and latitude. 

15 Model estimates were derived using data from three of the SOLDAY 

17 stations: St. Cloud, Minnesota; Rapid City, South Dakota; and Glasgow, 

18 Montana. Both continuous cropping and summer fallowing practices were 

19 used. Median planting dates for each year at each station were estima- 

20 ted using a spring small grains planting date model (Hodges and Artley, 

21 1981). Daily values of rainfall and maximum and minimum temperature 

22 were required for each station. 

23 The solar radiation input was first supplied by the rehabilitated 

24 ground truth data. Five additional model estimates were generated with 

25 identical inputs for all variables except solar radiation. For each 

26 of these, data estimated using one of the solar radiation algorithms 

27 w ere used. The model was also run using unscreened values from the 
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ground truth solar radiation and from the R2 and CE algorithms. On the 
average, yields were reduced by less than 1% compared to model estimates 
using screened data. 

The resulting predicted yields using each of the solar surrogates 
could then be compared to yields predicted using the ground truth data. 
The algorithm which led to results most similar to that using ground 
truth data could then be determined. The sensitivity of the model to 
variations between the algorithms could also be studied. 

For the algorithm comparison, the yearly difference (D) between 
ground truth yield predictions (6TY) and predictions using each surro- 
gate (EST) would be calculated: 

D = 6TY - EST. 

The arithmetic mean of D would indicate the bias of the yield 
estimates. Smaller bias measures would imply better surrogates. Bias 
values would be calculated for each algorithm for each station for 
continuous cropping and for summer fallow, a total of thirty values. 

It would also be important for the root mean square error, RMSE, to 
be small to indicate that more estimates have a small 0 value than a 
large one. This statistic is calculated by: 

RMSE = 

The standard deviation of the 0 values (SD) is also calculated. 

This indicates what the RMSE would be if the bias were removed. 

Maximum values of D (MAX D) and minimum values of D (MIN D) would 
also be compared. As a final measure of the similarity between GTY and 
EST, the Pearson correlation coefficient, CORR, would be determined. 
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COMPARISON RESULTS 

Statistics used for comparison of yield predictions using each of 
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the five solar radiation surrogates are shown in Table 3. Mean and 
standard deviations of yield predictions using the rehabilitated solar 
radiation data (GT) were compared to predictions using each solar 
surrogate. 

In half of the cases, R2 produced yield estimates most similar to 
those using ground truth data. SR was nearest in one third of the cases. 
The CE algorithm had the highest bias generally. The SR algorithm had 
a standard deviation of the estimates nearest to ground truth. RO was 
the poorest in terms of similar variation. 

The RMSE values indicate that the R2 algorithm produced yield 
estimates which had less variation around ground truth, followed by CE. 

RO again was the poorest. If the bias were removed, the SD values show 
that the CE algorithm would have been least variable. RO was, again, 
the poorest. The bias, being a function of the model's sensitivity to 
solar radiation, would be difficult to remove. 

The range of the data, shown by MAX D and MIN D, indicates that for 
St. Cloud, MN summer fallow, all of the algorithms except R2 produced 
estimates which were too low in all of the years. For other areas, the 
ranges are comparable. 

The correlation values indicate the closest correspondence between 
CE yield estimates and ground truth. RO and R2 did poorest using this 
criterion. 

Although "best" and "worst" surrogates could be detected, all were 
very close. Each would be acceptable in terms of the correlation of 
their predicted yields with those yields predicted using the 
rehabilitated solar radiation data. Differences between indicators for 
summer fallowed and continuous cropped were generally negligible. 
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DISCUSSION 

The magnitude of day to day variability of so 'a" radio i in would be 
more critical than day to day accuracy. This is due to the strong 
effect of solar radiation on the modeled soil water balance. When ample 
soil moisture is available, evaporation occurs at an "energy-limited" 
rate proportional to the energy available from solar radiation. When 
the modeled water content of the soil surface is depleted more than a 
certain amount (U), direct evaporation of water from the profile 
(excluding transpiration) occurs at a rate roughly proportional to the 
square root of the number of days on which drying has occurred. This 
"time-limited" evaporation rate is generally much lower than the 
energy-limited evaporation rate. On rainy days, 4/5 of the rainfall is 
available for evaporation at the "energy- limited" rate even if the sur- 
face water depletion is greater than U. On the next dry day, moisture 
that has entered the profile is evaporated at the "time-limited" rate if 
the surface water depletion is greater than U. Consider that in a dry 
situation when a small amount of rainfall occurs, moderate or high 
solar radiation will cause near total evapi -ation. However, low solar 
radiation will allow most of the rain to enter the soil profile and be 
subject to "time-limited" evaporation. Thus, on two days with small 
amounts of rainfall, daily solar radiations of 700 and 100 langleys 
respectively would allow considerably more water to enter the profile 
than would two days of 400 langleys each. 

In the Ceres-wheat model, carbon fixation is affected by solar 


27 


radiation ’n a nonlinear fashion. For radiation amounts to 467 
langleys/duy of intercepted light, carbon fixation is proportional to 
light. At higher light intensities, no additional carbon is fixed. 



Uniformly moderate solar radiation (as opposed to highly variable solar 
2 radiation) will result in more biomass accumulation, more leaf growth, 

2 more water use, and under moist conditions, higher yield. However, with 
I dry conditions, more water stress and lower yield will result, 
j CONCLUSIONS 

, Although the differences between algorithms were small, the bias 

7 and root mean square error indicated that the R 2 algorithm would be 

8 recommended as a solar radiation surrogate for use in simulation models, 
g When used in the Ceres-wheat yield mode), the R 2 solar radiation surro- 

10 gate produced yield predictions closest to those using ground truth 
22 solar radiation data. The CE algorithm also produced close estimates, 

22 but had a larger bias which would be difficult to remove as it is a 

23 function of the model's sensit vity to solar radiation. 

24 The R 2 algorithm would also be recommended for l e foreign areas. 

2 5 The location specific coefficients for the R 2 algorithm can be derived 

26 from long term average monthly solar radiation values; these would be 

27 available world-wide (de Jong, 1973 ). Only an assumption about the 

28 average difference between solar radiation on rainy days and on dry days 

29 for a location would be needed for use of this surrogate. Because of 

20 this, the R 2 algorithm would be recommended for use in areas for which 

21 neither measured nor satellite estimated solar radiation values are 

22 available. . 
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Table 1. Equations for Solar Radiation Simulation Algorithms 
a. CE Algorithm (Cengiz et al, 1981) 

Solar Radiation * 49.03 + .1 *FIS - 7.26 *DBR 
♦ .06 * FIS * OBR 
Location specific functions: 

S = Sin (Latitude * n/180.) 

T = Tan (Latitude * n/180.) 

C = Cos (Latitude * n/180.) 

SLO = Arcsin ((.5 + .007895/C + .2168875 *T)^2) * 180./ n 
10 SN = Sin ( n* SLD/24. ) 

n A = (S * (46.355 * SLD - 574.3885) + 816.41 * C * SN) 

12 * (.29 * C + .52) 

13 B = (S *(574.3885-1.509 * SLO) - 26.59 * (C*SN) * (.29 * C+.52)) 
Daily Functions: 

15 SI = Sin (2 n/365. * (JULIAN DATE + 10.5) - 1.5708) 

16 FIS = A + B * SI 

17 DBR = (TX - TN)* 5/9 

lg b. SR Algorithm (Richardson. 1981) 

19 Solar Radiation = RM (I) + AR * cos (.0712 * (Julian Date - 172)) 

20 RM (1) = Annual mean solar radiation for dry days 

2 1 RM (2) * Annual mean solar radiation for rainy days 

22 AR = Amplitude of annual solar radiation curve 

23 For dry days, 1=1; for rainy days, I = 2 
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1 

Table 1 (continued) 

2 

c. R0 Algorithm (largely based on Richardson, 1981) 

3 

Solar Radiation = SRL * SRSD ♦ SRBAR 

4 

If Solar Radiation < 0.0 then Solar Radiation - 0.0 

5 

If Solar Radiation > 770. then Solar Radiation = 770. 

6 

location specific constants: 

7 

TXM (1) and TXM (2) * mean annual maximum daily temperatures for 

6 

dry days (1) and wet days (2). 

9 

ATX = Amplitude of annual curves (dry day and wet day) daily 

10 

maximum temperature 

11 

CVTX = coefficient of variation of daily deviations of maximum 

12 

temperature from annual curves 

13 

ACVTX * coefficients of variation of AiX 

14 

TNM = mean annual daily minimum temperature 

15 

ATN = Amplitude of annual curve of daily minimum temperature 

16 

CVTN = coefficient of variation of daily deviations of minirum 

17 

temperature from annual curve 

18 

ACVTN = coefficient of variation of ATN 

19 

RM (1) and (2) * mean annual daily solar radiation for dry days 

20 

(1) and rainy days (2) 

21 

AR = amplitude of annual curves of daily solar radiation 

22 

CVR (1) and (2) = coefficients of variation of daily deviations 

23 

of solar radiation from annual curves for dry days (1) and 

24 

for rainy days ACVR (1) and (2) = coefficients of variation 

25 

of AR for dry days (1) and for wet days (2) 

26 
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Table 1 (continued) 

Daily functions: 

SRSD = A8S (SRBAR * (CVR (I) + ACVR (I) * DR)) 

SRBAR = RM (1) ♦ AR * DR 

A and b are matrices (3 x 3) derived by Richardson to describe the 
intercorrelations between daily maximum and minimum temperatures 
and solar radiation in the continental United States. 

I = 1 for dry days or I = 2 for rainy days 

ASRL = A (3,1) * PTXI. ♦ A (3,2) * PTNL + A (3,3) * PSRL 

where PTXI, PTNL, and PSRL are TXL, TNL, and SRL values from 
the previous day 

DT = Cos (.0172 * (Julian date - 200)) 

DR = Cos (.0172 * (Julian date - 172)) 

TXBAR = TXM ( I ) + ATX * .DT 

TXSD = ABS (TSBAR * (CVTX + ACVTX * DT)) 

The above equations are from the Richardson (1981) weather simulation. 

The following five equations were developed to adapt the weather 

simulator to estimate only solar radiation: 

SRL = ASRL + B (3,1) * TXL + B (3,2) * TNL 

TXL * (TX - TXBAR)/TXSD 

If TXL > 1.5 or TXL < -1.5 then TXL * 0.0 

TNL = (TN - TNBAR)/TNSD 

If TNL > 1.5 or TNL < -1.5 then TNL = 0.0 

. R1 Algorithm 

Same as RO algorithm except: 

SRSD * .25 * SRBAR 

TXSD = 9. 


TNSD = 9. 





Table 1 (continued) 

R2 Algorithm 

Same as RO algorithm except: 

Solar Radiation = Noise * SRI * SRSD + SRBAR 
SRSO * .1 * SRBAR 
TXSD = 14. 

TNSO = 14. 

where Noise = 4.4 for SRI > 0.0 on dry days 
* 11.44 for SRL >_ 0.0 on rainy days 
= 13.2 for SRL < 0.0 on dry days 
= 34.32 for SRL <_ 0.0 on rainy days 
If Solar Radiation > 770.1 y/day then Solar Radiation = 770. 

RM (1) and RM (2) should be approximately 5% and 15% greater than 
the actual annual mean daily solar radiation for dry days and 
rainy days respectively. 
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tatistics Tor comparison of solar radiation estimates to ground 
measured solar radiation (6T) 
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Table 

3. Statistical Comparison of Yield Estimates 
from the CERES-Wheat Model Using the 
"Ground Truth" Data (GT) and Estimates 
Using Each of the Algorithms for Solar 
Radiation Surrogates in the Model 
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6 


MEAN 

BIAS 

STD 

RMSE 

SD 

MAX D 

MIN D 

C0RR 

7 

Summer Fallow 









8 

GT 

3193 

— 

601 

-- 

-- 

-- 

-- 

1.000 

9 

RC 

3574 

-381 

625 

445 

237 

-5 

-1051 

.926 

10 

R 1 

3591 

-398 

630 

461 

238 

-40 

-1084 

.926 

11 

R2 

3078 

115 

672 

263 

242 

690 

-250 

.934 

12 

CE 

3607 

-253 

629 

288 

142 

-15 

-615 

.919 

13 

SR 

3446 

-414 

587 

480 

248 

-45 

-1121 

.972 


14 Continuous Crop 
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Table 3 (continued) 
Glasgow, MT 
n * 22 



1 

2 
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MEAN 

BIAS 

STD 

RMSE 

SD 

MAX D 

MIN 0 

CORR 

6 

Summer Fallow 
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GT 

1474 

-- 

1200 

-- 

— 

— 

— 

1.000 

8 

RO 

1410 

64 

1330 

416 

421 

1652 

- 330 

.950 

9 

R1 

1422 

53 

1338 

404 

410 

1645 

- 457 

.954 

10 

R2 

1618 

-143 

1207 

467 

455 

970 

-1480 

.928 

11 

CE 

1403 

99 

1323 

420 

418 

1358 

- 749 

.951 

12 

SR 

1375 

72 

1247 

410 

414 

1688 

- 332 

.942 

13 

Continuous Crop 
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GT 

780 

— 

951 

— 

— 

— 

— 

1.000 

15 

RO 

805 

- 25 

1038 

275 

280 

591 

-707 

.964 

16 

R1 

799 

- 20 

984 

253 

258 

756 

-608 

.965 

17 

R2 

881 

-101 

1038 

357 

350 

1083 

-735 

.942 

18 

CE 

799 

- 20 

1030 

238 

243 

505 

-648 

.973 

19 

SR 

767 

13 

960 

223 

227 

679 

-351 

.972 


20 

21 

22 

23 

24 

25 

26 


27 






